Replication Archive for "Women�s Authority in Patriarchal Social Movements: The Case of Female Salafi Preachers"
American Journal of Political Science
Richard Nielsen

This archive last modified on 3/7/2019.

##############################
## Downloading the archive: ##
##############################

Because of the Dataverse file size constraints, the archive is all in a single zipped directory
EXCEPT for a single file, "stemListHolder_4feb2019.rds".  This file is necessary if you want
to run the scripts below quickly.  It is one of several saved intermediate objects that save
time (rather than taking hours to create them, I have saved them and you may load the saved version).

After downloading and extracting the archive to your desired directory, save the file
"stemListHolder_4feb2019.rds" in the subdirectory ~/archive_daeyat/savedObjects.


###########################
## Computer Requirements ##
###########################

I used a Windows 10 machine with the following specifications:
Processor: Intel(R) Xeon(R) W-2145 CPU @ 3.70GHz
RAM: 32.0 GB
System type: 64-bit Operating System, x64-based processor

I used several versions of R, but all of the replication archive scripts
were checked in R 3.5.2 (the latest as of this writing).

################################
## Main Scripts (start here): ##
################################

~/scripts/daeyat_analysis.R -- This is the main analysis script.  It both combines the data 
                               from various sources and executes the analysis.  This script
                               shows everything I did, not just the results that made it into
                               the paper.  For this reason, there are many extra analyses and
                               notes to myself.  My intent is that these might be useful to
                               understand everything I tested, not just the tests that were
                               important enough to report in the paper.

                               This script creates many large objects that are time-consuming.
                               To save time, I have saved these objects as part of the replication
                               archive (in the subdirectory ~/savedObjects).  Near the top of 
                               ~/scripts/daeyat_analysis.R, the line:
                                     useSavedObjects <- TRUE
                               specifies that you are using these saved objects.  If you prefer
                               to not use the saved objects, uncomment the line immediately 
                               following:
                                     # useSavedObjects <- FALSE  ## Uncomment this line to run everything yourself
                               If you are not using saved objects, this script will take at least
                               24 hours to run on a reasonably powerful computer.


~/scripts/analyze_archive.org.R -- This script processes the raw data I scraped from archive.org
                                   showing when each preacher was added to the saaid.net website.
                                   This script creates a variable that is used in the script
                                   ~/scripts/daeyat_analysis.R and produces Figure 1 of the
                                   web appendix.

########################
## Auxiliary Scripts: ##
########################

~/scripts/scrape archive.org.R -- This script WILL NOT RUN.  It is the code that scrapes the website
                                  archive.org.  The filepaths are for my local machine where I 
                                  originally saved the data, and the scraping code is partially
                                  commented out to avoid overwhelming the website's capacity.  I
                                  Include it here so you can see how I scraped the data.

~/scripts/ingest_raw_files.R -- This script WILL NOT RUN.  It is the code that ingests the raw text
                                data I scraped from www.saaid.net.  The raw text files are 16 GB, so
                                including them in the replication archive isn't feasible.  The texts
                                are saved in an .rds object called in ~/scripts/daeyat_analysis.R.

~/scripts/scrape_twitter.R -- This script WILL NOT RUN.  It is the code I used to scrape Twitter and
                              it requires my personal API keys to run, which I am not including. I
                              include the script here for completeness so that readers can see how I
                              collected the data and recollect it themselves with their own API keys.

~/scripts/getTimeline.R    -- This is an script containing a function for getting a Twitter user's 
                              timeline.  It is called in ~/scripts/scrape_twitter.R.



Files:

A complete file list of the archive is infeasible, because it currently contains over 1,000 files.


